International Journal of 
PARALLEL PROGRAMMING 


Vol. 31, Number 1 February 2003 





CONTENTS 


Special Issue: International Symposium on High Performance 
Computing 2002 


Guest Editor: Kazuki Joe 


Guest Editor’s Introduction 
Kazuki Joe 


Exploiting Distributed-Memory and Shared-Memory Parallelism on 
Clusters of SMPs with Data Parallel Programs 
Siegfried Benkner and Viera Sipkova 


Parallel Merge Sort with Load Balancing 
Minsoo Jeon and Dongseung Kim 


Performance Analysis Integration in the Uintah Software Development 
Cycle 
J. Davison de St. Germain, Alan Morris, Steven G. Parker, 
Allen D. Malony, and Sameer Shende 


Block Red-Black Ordering: A New Ordering Strategy for Parallelization 
of ICCG Method 
Takeshi Iwashita and Masaaki Shimasaki 








International Journal of 
PARALLEL PROGRAMMING 


Vol. 31, Number 2 April 2003 





CONTENTS 


Non-Strict Execution in Parallel and Distributed Computing 
Alfredo Cristobal-Salas, Andrei Tchernykh, Jean-Luc Gaudiot, 
and Wen- Yen Lin 


An Extended ANSI C for Processors with a Multimedia Extension 107 
Patricio Bulié and Veselko Gustin 


Alloyed Branch History: Combining Global and Local Branch History 
for Robust Performance EST 
Zhijian Lu, John Lach, Mircea R. Stan, and Kevin Skadron 








International Journal of 
PARALLEL PROGRAMMING 


Vol. 31, Number 3 June 2003 





CONTENTS 
Special Issue: OpenMP: Experiences and Implementations 


Guest Editor: Eduard Ayguade 


Erratum 


Guest Editor’s Introduction 
Eduard Ayguade 


Performance Evaluation of the Hitachi SR8000 Using SPEC 
OMP2001 Benchmarks 
Daisuke Takahashi, Mitsuhisa Sato, and Taisuke Boku 


Large System Performance of SPEC OMP Benchmark Suites 
Hideki Saito, Greg Gaertner, Wesley Jones, Rudolf Eigenmann, 
Hidetoshi Iwashita, Ron Lieberman, Matthijs van Waveren, 
and Brian Whitney 


Static Coarse Grain Task Scheduling with Cache Optimization Using 
OpenMP 
Hirofumi Nakano, Kazuhisa Ishizaka, Motoki Obata, 
Keiji Kimura, and Hironori Kasahara 


Optimizing OpenMP Programs on Software Distributed Shared 
Memory Systems 
Seung-Jai Min, Ayon Basumallik, and Rudolf Eigenmann 








International Journal of 
PARALLEL PROGRAMMING 


Vol. 31, Number 4 August 2003 





CONTENTS 


Hybrid Analysis: Static & Dynamic Memory Reference Analysis 
Silvius Rus, Lawrence Rauchwerger, and Jay Hoeflinger 


A Network-Failure-Tolerant Message-Passing System for Terascale 
Clusters 
Richard L. Graham, Sung-Eun Choi, David J. Daniel, 
Nehal N. Desai, Ronald G. Minnich, Craig E. Rasmussen, 
L. Dean Risinger, and Mitchel W. Sukalski 


Restructuring Computations for Temporal Data Cache Locality 
Venkata K. Pingali, Sally A. McKee, Wilson C. Hsieh, and 
John B. Carter 








International Journal of 
PARALLEL PROGRAMMING 


Vol. 31, Number 5 October 2003 





CONTENTS 


Time Optimal Software Pipelining of Loops with Control Flows 
Han-Saem Yun, Jihong Kim, and Soo-Mook Moon 
On the Performance of Randomized Embedding of Reproduction 
Trees in Static Networks 393 
Kegin Li 








International Journal of 
PARALLEL PROGRAMMING 


Vol. 31, Number 6 December 2003 





CONTENTS 


Special Issue: Workshop on Application Specific Processors (WASP) 
Guest Editor: Alex Orailoglu 


Guest Editor’s Introduction 
Alex Orailoglu 


Automatic Application-Specific Instruction-Set Extensions 
under Microarchitectural Constraints 
Kubilay Atasu, Laura Pozzi, and Paolo Ienne 


Automatic Design of Application Specific Instruction Set 
Extensions through Dataflow Graph Exploration 
Nathan Clark, Hongtao Zhong, Wilkin Tang, and Scott Mahlke 
Power-Aware Compilation for Register File Energy Reduction 
José L. Ayala, Alexander Veidenbaum, and Marisa Lopez-Vallejo 


On the Effectiveness of Flow Aggregation in Improving 
Instruction Reuse in Network Processing Applications 
G. Surendra, S. Banerjee, and S. K. Nandy 


A Reconfigurable Logic-Based Processor for the SCAN 
Image and Video Encryption Algorithm 
C. Kachris, N. Bourbakis, and A. Dollas 











